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ABSTRACT 

A DNA copy of the mRNA that encodes the nucleocapsld protein of Mouse 
Hepatitis Virus JHM has been cloned into pAT153. The DNA copy specifically 
inhibited the synthesis hi vitro of the nucleocapsid protein. The cDNA was 
subcloned into M13 vectors and the entire sequence, 1767 bases Including a 15 
base terminal poly (A) tract, has been determined by chain-terminator se¬ 
quencing. The sequence contained an open-reading frame that could encode a 
basic protein of mol.wt. 49700. From the predicted sequence it was apparent 
that the nucleocapsid protein has 5 basic regions, two of which are located 
near the middle of the sequence, a serine-rich region was also located, a 
feature which may be of functional importance as the nucleocapsid protein is 
phosphorylated at serine residues. The carboxy terminus of the nucleocapsid 
protein was found to be acidic. The 5' non-coding sequence contained a triple 
repeat of the pentamer AATCT, a structural feature which may play a signifi¬ 
cant role during the production of subgenomic viral nflNAs. 


INTRODUCTION 

Coronaviruses are enveloped, positive-stranded RNA viruses which infect 
vertebrates and are responsible for diseases of clinical and economic impor¬ 
tance, in particular respiratory and gastro-lntestlnal disorders*. The 
most-studied member of the group Is Murine Hepatitis Virus (MHV). Depending 
upon the age and genetic background of the host, different MHV strains 
display differences in virulence, pathogenicity and In organ and cell tro- 
plsms. They therefore provide useful models for a variety of disease pro¬ 
cesses. Of particular Interest is the neurotropic MHV strain, JHM, which has 
the ability to induce acute and subacute disorders of the central nervous 

system in rodents and has been used as a model for virus-induced demyell- 
2-4 

nation 

The Mouse Hepatitis Virus (MHV) genome is a linear, unsegmented, infec¬ 
tious RNA which is about 18 kilobases long. MHV replicates in the cytoplasm 
of infected cells and the viral genetic information is expressed as one 
genome-sized and six subgenomic mRNAs. The largest (genome-sized) nflNA is 
termed mRNA 1 and the smaller mRNAs are numbered in order of decreasing size. 
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These mRNAs are synthesized in non-equimolar amounts but in constant propor¬ 
tions. For a review of coronavlrus replication, see ref. 1. Analysis by RNase 

T. oligonucleotide fingerprinting and by hybridization to virus specific 
1 5-10 

cDNA probes , reveals that these mRNAs are 3'coterminal forming a 

"nested-set". Furthermore, analysis of RNase Tj oligonucleotides from sub- 

genomic mRNAs and from the equivalent regions of the genome suggests that the 

subgenomic mRNAs bear a leader sequence derived from the 5' end of the 
7 8 

genome ’ .The translation of IW mRNAs in vivo , in cell-free systems or in 
oocytesshows that each mRNA is translated independently to produce a 
single protein, the size of which corresponds to the coding capacity of the 
5' sequences not found in the next smallest mRNA. The smallest and most 
abundant mRNA (mRNA7) encodes the virion nucleocapsid protein, a basic, 
phosphorylated polypeptide of 50-60000 mol.wt. The next most abundant RNA, 
mRNA6, encodes the virion matrix glycoprotein(s) (23000 to 25000 mol.wt.) 
and the third major intracellular RNA, mRNA3, encodes the virion peplomer 
glycoprotein(s) (90000 and 180000 mol.wt.). The remaining Intracellular 
mRNAs, which are found in lesser amounts, are thought to encode viral 
non-structural proteins. 

In the experiments described here, we have isolated DNA copies of 
MHV-JHM mRNA7 and obtained a complete sequence of the mRNA. This nucleotide 
sequence and the predicted sequence of the encoded protein can be compared 
with the equivalent sequences recently obtained for the hepatotropic MHV 
strain, A59 14 . 

MATERIALS AND METHODS 
Chemicals 

Avian myeloblastosis virus reverse transcriptase was supplied by Life 
Sciences (St. Petersburg, Florida). Radiochemicals were supplied by Amersham 
Buchler (Braunschweig, F.R.G.). M13 pentadecamer sequencing and hybridi¬ 
zation probes, oligo dT^ and oligo dG^ were obtained from PL Bio¬ 
chemicals (St. Goar, F.R.G.). Escherichia coli DNA polymerase I (large frag¬ 
ment), SI nuclease and terminal deoxynucleotidyl transferase were obtained 
from Bethesda Research Laboratories (Neu-Isenburg, F.R.G). New England 
Nuclear (Dreieich, F.R.G.) supplied T4 DNA ligase. Restriction enzymes were 
obtained from PL Biochemicals, Bethesda Research Laboratories and Boehringer 
Mannheim (Mannheim, F.R.G.). 

Synthesis and Cloning of cDNA 

Polyadenylated RNA was isolated from Sac(-) cells that had been infect- 
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ed with MHV-JHM, as previously described . Double-stranded cDNA was pre¬ 
pared from 10 ^ug of polyadenylated RNA, using oligo dT and oligo dG to prime 
first and second strand synthesis, respectively, then oligo dC-tailed, 
double-stranded cDNA was annealed to Pstl-cleaved, oligo dG-tailed pAT153, 
according to protocols described by Land et al ^. Escherichia coll HB101 was 
transformed with the annealed plasmid/cDNA using the method of Dagert and 
Ehrlich^. Tetracycline-resistant, ampici 11 In-sensitive bacterial clones, 
from two independent cDNA cloning experiments, were screened for JHM-speci- 

fic sequences by hybridization with a single-stranded cDNA probe, containing 
32 

P, that had been copied from genome RNA isolated from purified virions. 
Characterization of cloned cDNA 

The size of inserts in plasmids from strongly-hybridizing clones was 

determined by gel electrophoresis of DNA extracted by the method of Holmes 
18 

and Quigley . Plasmids containing the largest inserts were prepared from 1 

litre cultures and were purified by equilibrium centrifugation in ethidlum 

bromide/caesium chloride. Inserts were excised from the plasmids using PstI 

and were recovered from agarose gels by electroelution. The inserts were 

mapped by partial digestion with restriction enzymes of DNA that had been 
32 

labelled using P cordycepin triphosphate and terminal deoxynucleotidyl 
transferase. 

Nucleotide sequencing 

Fragments of two cDNA inserts were generated by a variety of restriction 

iq 

enzymes and cloned into the M13 vectors mp 8 and mp 9 . The fragnents were 

20 

sequenced using the chain terminator method of Sanger et al. . 77 * of the 
cDNA was sequenced on both strands, a further 14 % on different but over¬ 
lapping fragments of the same strand and the remainder was sequenced at least 
twice. Towards the end of the sequencing project specific clones were identi¬ 
fied by their hybridization to a panel of characterized M13 clones. M13 

21 

hybridization probes were prepared by the method of Hu and Messing . 
Translation in vitro. Hybrid-arrested translation and polyacrylamide gel 
electrophoresis 

Polyadenylated RNA from cells infected with MHV-JHM was translated in a 

15 

rabbit reticulocyte lysate as previously described . Hybrid-arrested trans¬ 
lation experiments, using purified cDNA insert and polyadenylated RNA from 

cells Infected with MHV-JHM, were performed according to the method of Pater- 
22 

son and Kuff . usinq a rabbit reticulocyte lysate. Translation products 
were analysed on 15 t polyacrylamide gels 23 . 
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Fig. 1 

Hybrid-arrested translation of MHV-JHM mRNA7 

Autoradiograph of the JS S methionine-labelled products synthesized in a rab¬ 
bit reticulocyte lysate and separated on a 15 % polyacrylamide-SDS gel. The 
samples translated were: (a) no added RNA, (b-f) 250 ng of polyadenylated, 
cytoplasmic RNA from cells Infected with MHV-JHM, and either 500 ng of the 
insert DNA (c and d) or 1000 ng of the insert DNA (e and f). Samples (c) and 
(e) were in the hybrid conformation and samples (d) and (f) were heated to 
melt the hybrids. The major products of 60000 and 23000 mol.wt. have been 
identified.as the nucleocapsld and matrix proteins respectively. Sample (m) 
contained ^C-labelled molecular weight markers (CFA626, Amersham Buchler, 
Braunschweig, F.R.G.). 


RESULTS 

Identification of cDNA clones as copies of irftNA7 

The largest JHM-specIfic cDNA clones from each of two cloning experi¬ 
ments (those within pMS38 and pSS38), were analyzed by digestion with re¬ 
striction enzymes, showing that together they contained enough sequence to 
represent a full-length copy of mRNA7 (about 1800 bases 24 ). The insert in 
pSS38 was about 1700 base pairs and was therefore sufficient to account for 
most of mRNA7. The insert in pMS38 contained 830 base pairs. Mapping of both 
inserts, by partial digestion with restriction endonucleases, showed that 
the Insert in pMS38 contained an extra 40 base pairs at one end. 

Confirmation that the cDNA insert in pSS38 represented the body of mRNA7 
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Fig. 2 

Diagram showing those restriction endonuclease sites, in the DNA copy of 
MHV-JHM mRNA 7, used for subcloning into M13 vectors. Solid arrows show the 
direction and extent of sequence obtained from each clone. Broken lines 
indicate the probable extent of each clone. Restriction endonuclease clea¬ 
vage sites are denoted by the symbols: f Haelll, * MspI, ♦ Ball, f Alul, 
A Sau3a and O PvuII. The box at the 3‘ terminus represents the poly(A) 
tract. 


was obtained by hybrid-arrested translation (Fig. 1). When hybridized to 

cytoplasmic, polyadenylated RNA from cells Infected with JHM, the insert 

specifically Inhibited the translation of a 60000 mol.wt. polypeptide that 

has been previously identified as the intracellular precursor of the virion 

15 

nucleocapsld protein produced from mRNA7 . Melting of the hybrids before 
translation restored the synthesis of nucleocapsld protein. 

Nucleotide sequence of cloned cDNA 

Fragments of both cONA Inserts were generated by various restriction 
enzymes (Fig. 2), cloned into M13 vectors and their sequences were deter¬ 
mined. The combined sequence of 1767 base pairs (contained between the oligo 
dC/dG tails made during the process of cloning the cDNA) is presented in Fig. 
3. A 5‘ non-coding sequence of 83 base pairs preceded the first AUG codon, 
which Initiated an open reading frame (1365 bases long) with the potential to 
encode a basic polypeptide of 455 amino acids (49700 mol.wt.). The first 
putative termination codon in this reading frame was found at nucleotide 1449 
and was followed by 301 base pairs of 3' non-coding sequence. A polyadenylate 
tract was found beginning at nucleotide 1753. The only other open-reading 
frame of greater than 100 bases was from nucleotides 361-771 (inclusive), 
which is potentially able to encode 137 amino acids. Within the 5' non-coding 
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(1434 ) 6AT GAC TCT AAT GTG TAAAGAGAAT6AATCCTAT6TCGGCACTCG6T66TAACCCCTCGC6A6AAAGTCGG6ATAGGACACTCTCTATCA6AAT 
(1528) GGAT6TCTTGCTGTCATAACA6ATAGA6AAGGTTGTGGCA6ACCCTGTATCAATTAGTTGAAAGA&ATTGCAAAATA6A6AATGTGT&AGA6AA6TTAG 
(1627) CAAGGTCCTACGTCTAACCATAAGAACG6C6ATAGGCGCCCCCTGG6AAGAGCTCACATCAGGGTACTATTCCTGCAATGCCCTAGTAAATGAAT6AAG 
(1726) TT6ATCATGGCCAATTGGAAGAATCACAAAAAAAAAAAAAAA 


Fig. 3 

Complete nucleotide sequence of the DNA copy of MHV-JHM mRNA7 (1767 nucleo¬ 
tides), including a 15 base long, terminal poly (A) tract. The predicted 
sequence of the encoded protein is also depicted. 


sequence, two large, RNase Tj-resistant oligonucleotides were found (at 
positions 26-50 and 54-82), the compositions of which were very similar to 

g 

those of the MHV-A59 oligonucleotides 10 and 19 , respectively. Strikingly, 
the larger oligonucleotide contained a triple repeat of the pentamer AATCT 
and this sequence was found within both, independently generated clones. 


5050 


Downloaded from http://nar.oxfordjournals.org/ at University of Bristol Library on March 20. 2015 




Nucleic Acids Research 


DISCUSSION 

Analysis of the sequence of MHV genome RNA and subgenomic mRNAs Is a way 
of predicting the primary structure of encoded polypeptides and, at the same 
time, identifying non-coding sequences which may be relevant to the regu¬ 
lation of the viral genes. A comparative analysis of the sequences of dif¬ 
ferent MHV strains, or mutants with altered phenotypes, will also be needed 
as a basis for further studies on MHV replication and pathogenicity. 

Three criteria define the cloned sequences we have described here as re¬ 
presenting the MHV-JHM nucleocapsid gene. Firstly, the 3' terminal location 
of this sequence on the genome (unpublished data) is consistent with the 
known gene order of MHV-JHM^. Secondly, the cloned DNA has the ability to 
specifically arrest the translation j_n vitro of mf(NA7 and, thirdly, the 
characteristics of the predicted polypeptide as a basic protein of 49700 
mol.wt. are consistent with the electrophoretic properties of the nucleocap¬ 
sid protein In two-dimensional, non- equilibrium, pH gradient, gel electro- 
29 

phoresis . We believe that the cDNA sequence represents a complete copy of 
mRNA7 because it extends from a polyadenylate tract to a 5' terminal sequence 
UAUAAG, which is very similar to the cap-NUAAG sequence found by direct 
sequencing at the 5' terminus of mRNAs of the related MHV-A59®. 

Examination of the sequence of MHV-JHM nflNA7 reveals several interest¬ 
ing features. The presence of the two large RNase T,-resistant ollgonucleo- 

A 8 

tides (with similar base compositions to oligonucleotides 10 and 19 of A59 ) 

within the 5' non-coding sequence supports the proposal that MHV subgenomic 

mRNAs contain a leader sequence ’ . The A59 oligonucleotide 10 (equivalent 

to the JHM oligonucleotide at position 26 - 50) is found in all subgenomic 

mRNAs and In genomic RNA. It is not, however, found in 3' terminal fragments 

of the genome corresponding to mRNA7 and so must be nearer the 5' end of the 

genome. A59 oligonucleotide 19 (equivalent to the JHM oligonucleotide at 

position 54 - 82) is found only In mRNA7, but not in the equivalent region of 

the genome, and related oligonucleotides are found In other mRNAs. It has 

therefore been suggested that oligonucleotide 10 is contained within a 

leader sequence 7 derived from the 5' end of the genome and that oligonucleo- 

7 8 

tide 19 is formed by fusion of the leader to the body of mRNA7 ’ . The 
relative positions of the equivalent JHM oligonucleotides are consistent 
with this proposal. It is Interesting to note that the JHM equivalent of 
oligonucleotide 19 contained the triple repeat of the pentamer AATCT. It Is 
likely that such a striking feature has a significant functional role In the 
production of subgenomic nftNAs. MHV is known to replicate in the absence of 
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27 28 

the host cell nucleus ’ . It has also been shown that the target size for 

the UV Inactivation of the synthesis of each MHV mRNA is identical to the 

29 

physical size of the otf?NA . Thus the mRNAs cannot be produced by splicing of 
larger precursors. It has been suggested^ that the mRNAs could be produced by 
extension of a virus-encoded RNA primer or by a previously undescribed, 
unusually-specif1c polymerase jumping mechanism. 

From the sequence alone, it is difficult to speculate about the impor¬ 
tance of specific features of the nucleocapsid protein structure, at least 
until more Is known about the interactions between nucleocapsid protein and 
genome RNA and between the rlbonucleoprotein complex and other virion pro¬ 
teins. Particularly Intriguing is the specific interaction of nucleocapsid 
protein with genome RNA but not with subgenomic nflNAs. The MHV-JHM nucleocap¬ 
sid protein does not contain the clusters of lysine residues seen in the 

30 

N-termlnal portion of the capsid protein of Semliki Forest Virus so, in 

this respect. It Is similar to the nucleocapsid protein of influenza virus'**. 

Some regions enriched in basic amino acids are however apparent (e.g. arg-43 

to lys-50, lys-101 to lys-113, arg-196 to lys-230, arg-264 to arg-290 and 

lys-392 to lys-405) and, as in the nucleocapsid protein of Semliki Forest 

Virus, the carboxy terminus is acidic. The region ser-194 to ser-220 contains 

9 serine residues, or 24 % of the total serine content within 6 % of the 

coding sequence. Moreover, this Is a basic region of the protein. Another 4 

serines are found In the five residues from ser-12 to ser-16. This clustering 

of serine residues may be significant in view of the fact that JHM nucleocap- 

32 

sid protein Is phosphorylated specifically at serine residues . 

The sequence of MHV-JHM mRNA7 can also be compared with the recently 
published sequence of the nucleocapsid gene of MHV-A59***. The major dif¬ 
ferences between the JHM coding sequence reported here and the sequence 
reported for MHV-A59*^ are that the JHM sequence contains an additional base 
(nucleotide 408) and lacks a base (after nucleotide 725). However, It is now 
clear (J. Armstrong, personal communication) that the A59 sequence should be 
identical to the JHM sequence at these positions. Therefore, the two sequen¬ 
ces are very similar, with a 94 l overall homology within the coding 
sequence. This finding Is in accord with the degree of homology based on 

hybridization kinetics of cONA, copied from MHV-A59 mRNA7, with mRNA from 

28 

cells infected with MHV-JHM . Although the sequence homology Is so high 
overall, it Is not constant throughout the length of the coding sequence. 
Between nucleotides 497 and 569 the homology falls to 63 X and in a sequence 
of 23 bases (nucleotides 1271-1293) near the 3' end of the coding sequence. 
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only 9 bases are coimion to both strains. Also, an extra glutamine codon is 
found In the JHM sequence at nucleotide 1227. 

The non-coding regions of mRNA7 from MHV-JHM and those reported for 

MHV-A59^ may also be compared. The 5' non-coding sequences are identical for 

the 17 nucleotides immediately before the AUG codon but the rest of the JHM 

14 

5' non-coding sequence Is different from the sequence reported for A59 . 

More recent studies on the sequence of A59 subgenomic messengers (J. Arm¬ 
strong, personal cotmiuni cation) have shown that the reported 5' non-coding 
sequence^ is not derived from nflNA7 and therefore further comparison cannot 
be made. When the 3' non-coding sequences are compared, they are found to be 
highly conserved (98 * homology). We believe that this conservation of se¬ 
quence at the termini of MHV RNA is significant and is most likely related to 
the interaction of these molecules with RNA polymerase during the synthesis 
of negative stranded template as well as during the production of subgenomic 
mRNAs and the replication of genome RNA. 

In conclusion, this comparison reveals that, as might have been expect¬ 
ed, the sequence encoding the nucleocapsid protein of MHV Is relatively 
conserved, even between strains of MHV that show differences in pathogeni¬ 
city. Our imnedlate aim is to determine the level of sequence homology, 
between the same strains, for genes more directly involved in the Interaction 
between virus and the host cells. 


ACKNOWLEDGEMENTS 

We thank Helga Kriesinger for typing the manuscript and Dr. M. Carter 
for critical reading of the manuscript. This work was supported by the 
Deutsche Forschungsgmeinschaft. 


REFERENCES 

1. STddell, S., Wege, H. and ter Meulen, V. (1983). 

J. Gen. Virol. 64, 761-776. 

2. Knobler, R.L., Campert, P.W. and Oldstone, M.B.A. (1982). Nature 
298, 279-280. 

3. Wege, H., Koga, M., Watanabe, R., Nagashlma, K. and ter Meulen, V. 
(1983). Infect. Immun. 39, 1316-1324. 

4. Dubois-Dalcq, M.E., DoTTer, E.W., Haspel, M.V. and Holmes, K.V. 
(1982). Virology 119, 317-331. 

5. Leibowitz, J.L., Wilhelmsen, K.C. and Bond, C.W. (1981). Virol. 114 , 
39-51 

6. Lai, M.M.C., Brayton, P.R., Armen, R.C., Patton, C.D., Pugh, C. and 
Stohlman, S. (1981). J. Virol. 39, 823-834. 

7. Spaan, W.J.M., Rottier, P.J.M., Horzinek, M.C. and van der Zeljst, 
B.A.M. (1982). J. Virol. 42, 432-439. 


5053 


Downloaded from http://nar.oxfordjournals.org/ at University of Bristol Library on March 20, 2015 






Nucleic Acids Research 


8. Lai, Patton, C.D. and Stohlman, S.A. (1982). 

J. Virol. 41, 557-565. 

9. Cheley, S., Anderson, R. t Cupples, M.J., Lee Chan, E.C. and Morris, 
V.L. (1981). Virology 112, 596-604. 

10. Weiss, S.R. and Leibowltz, J.L. (1983). J. Gen. Virol. 64, 127-133. 

11. Siddell, S.G. (1983). J. Gen. Virol. 64, 113-125. 

12. Rottier, P.J.M., Spaan, W.J.M., Horzinek, M. and van der Zeljst, 
B.A.M. (1981). J. Virol. 38, 20-26. 

13. Lelbowitz, J.L., Weiss, S7R., Paavola, E. and Bon, C.W. (1982). J. 
Virol. 43, 905-913. 

14. Armstrong, J.A., Smeekens, S. and Rottier, P. (1983). 

Nucl. Ac. Res. 11, 883-891. 

15. Siddell, S.G., TJige, H., Barthel, A. and ter Meulen, V. (1980). J. 
Virol. 33, 10-17. 

16. Land, H., Grez, M., Hauser, H., Lindenmaier, W. and SchLitz, G. 
(1983). In Methods in Enzymoloqy , eds. Wu, R., Grossman, L. and 
Moldave, K. (Academic Press, New York), Vol. 100, In press. 

17. Dagert, M. and Ehrlich, S.D. (1979). Gene 6, 23-28. 

18. Holmes, D.S. and Quigley, M. (1981). Anal. Biochem. 114 , 193-197. 

19. Messing, J. and Vierira, J. (1982). Gene 19, 269-276. 

20. Sanger, F., Nicklen, S. and Coulson, A.R. - (1977). 

Proc. Natl. Acad. Sci. USA 74, 5463-5467. 

21. Hu, N. and Messing, J. (19877- Gene 17, 271-277. 

22. Paterson, B.M., Roberts, 8.E. and KuTF, E.L. (1977). 

Proc. Nat. Acad. Sci. USA 74, 4370-4374. 

23. Laemmli, U.K. (1970). Nature (London) 227, 680-685. 

24. Wege, H., Siddell, S., Sturm, M. and ter Meulen, V. (1981). J. Gen. 
Virol. 54, 213-217. 

25. Siddell, S., Wege, H., Barthel, A. and ter Meulen, V. (1981). J. Gen. 
Virol. 53, 145-155. 

26. Cheley, - S., Morris, V.L., Cupples, M.J. and Anderson, R. (1981). 
Virology 115, 310-321. 

27. Brayton, P.R., Ganges, R.G. and Stohlman, S.A. (1981) 

J. Gen. Virol. 56, 457-460. 

28. Wllhelmsen, K.QTT Lelbowitz, J.L., Bond, C.W. and Robb J.A..(1981) 
Virology 110, 225-230. 

29. Jacobs, L., Spaan, W.J.M., Horzinek, M.C. and van der Zeljst, B.A.M. 
(1981) J. Virol. 39, 401-406. 

30. Garoff, H., FriscFauf, A., Simons, K., Lehrach, H. and Delius, H. 
(1980). Proc. Natl. Acad. Sci. USA 77, 6376-6380. 

31. Winter, G. and Fields, S. (1981). Virology 114, 423-428. 

32. Siddell, S.G., Barthel, A. and ter Meulen, V. (1981). J. Gen. Virol. 
52, 235-243. 


5054 


Downloaded from http://nar.oxfordjournals.org/ at University of Bristol Library on March 20. 2015 





